Keynote: Statistical Approaches to Open-domain Spoken Dialogue Systems
نویسنده
چکیده
In contrast to traditional rule-based approaches to building spoken dialogue systems, recent research has shown that it is possible to implement all of the required functionality using statistical models trained using a combination of supervised learning and reinforcement learning. This approach to spoken dialogue is based on the mathematics of partially observable Markov decision processes (POMDPs) in which user inputs are treated as observations of some underlying belief state, and system responses are determined by a policy which maps belief states into actions. Virtually all current spoken dialogue systems are designed to operate in either a specific carefully defined domain such as restaurant information and appointment booking, or they have very limited conversational ability such as in Siri and Google Now. However, if voice is to become a significant input modality for accessing web-based information and services, then techniques will be needed to enable conversational spoken dialogue systems to operate within open domains. This talk will discuss methods by which current statistical approaches to spoken dialogue can be extended to cover much wider domains. It will be argued that unlike many other areas of machine learning, spoken dialogue systems always have a user on-hand to provide supervision. Hence spoken dialogue systems provide a unique opportunity to automatically adapt on large quantities of speech data without the need for costly annotation.
منابع مشابه
PyDial: A Multi-domain Statistical Dialogue System Toolkit
Statistical Spoken Dialogue Systems have been around for many years. However, access to these systems has always been difficult as there is still no publicly available end-to-end system implementation. To alleviate this, we present PyDial, an opensource end-to-end statistical spoken dialogue system toolkit which provides implementations of statistical approaches for all dialogue system modules....
متن کاملWord order variations and spoken man-machine dialogue in French : a corpus analysis on the ATIS domain
During the last decade, spoken man-machine dialogue has known significant improvements that should lead shortly to the development of real use systems. In spite of these indisputable advances, numerous limitations restrict still the expansion of common use spoken dialogue systems. In particular, present researches in spoken man-machine communication lack seriously genericity. Most of spoken dia...
متن کاملRobust and adaptive architecture for multilingual spoken dialogue systems
We present how robustness and adaptivity can be supported by the spoken dialogue system architecture. AthosMail is a multilingual spoken dialogue system for e-mail domain. It is being developed in the EU-funded DUMAS project. It has flexible system architecture supporting multiple components for input interpretation, dialogue management and output generation. In addition to language differences...
متن کاملTowards Learning Transferable Conversational Skills using Multi-dimensional Dialogue Modelling
Statistical approaches to dialogue management have brought improvements in robustness and scalability of spoken dialogue systems, but still rely heavily on in-domain data, thus limiting their crossdomain scalability. In this paper, we present a new multi-dimensional, statistical dialogue management framework, in which transferable conversational skills can be learnt by separating out domaininde...
متن کاملDialogue-Oriented Review Summary Generation for Spoken Dialogue Recommendation Systems
In this paper we present an opinion summarization technique in spoken dialogue systems. Opinion mining has been well studied for years, but very few have considered its application in spoken dialogue systems. Review summarization, when applied to real dialogue systems, is much more complicated than pure text-based summarization. We conduct a systematic study on dialogue-system-oriented review a...
متن کامل